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TTTT E OF THF. INVENTION 

EN HANCED PROTEIN PRODUCTION IN HIGHER PLANTS BY N-TERMINAL FUSIO N_ 
•OFAUmWITrN ^ MOSAIC VIRUS COATPROTEIN PEPTIDE ^ 

5 ~" ' 

RACK GROUND OF THE INVENTION 

Strategies for production of proteins in heterologous fusion form have been widely 

applied in biotechnology for many purposes, such as secretion of proteins from host cells (fused 

to signal peptides), easy detection or purification of protein products (fused to reporter enzymes 

1 0 for detection and to peptide tags for purification), searching for proteins with desired biological 
activities (e.g., in the phage display technique and the two-hybrid system). Enhanced expression 
of proteins of interest has also been achieved by N-terminal fusion of a small peptide to the target 
protein. Fusion of a ubiquitin gene together with a ubiquitin promoter to the 5'-end of a gene of 
interest is one of the systems which has been used to enhance protein expression. Ubiquitin 

15 exists in all eukaryotic cells and is the most highly conserved protein yet identified. It is 
abundant in cells and exhibits profound stability to heat and proteolytic degradation. Moreover, 
ubiquitin precursors, that is, polyubiquitin where ubiquitin monomers are linked up head to tail 
and ubiquitin extension proteins where a single ubiquitin is appended at its C-terminus to either 
of two small ribosomal proteins, undergo rapid processing by ubiquitin C-terminal hydrolases, 

20 which cleave C-terminal of the ubiquitin moieties and release the free ubiquitin monomer and 
the C-terminal extension proteins. All of these features have rendered ubiquitin as an excellent 
N-terminal fusion partner to augment target protein accumulation in genetic engineering. 

The ubiquitin fusion approach was first developed by Butt et al. (1989), who showed that 
fusion of ubiquitin to yeast metallothionein or to the a subunit of the adenoylate cyclase- 

25 stimulatory GTP-binding protein increased the yield of these otherwise unstable or poorly 
expressed proteins from undetectable levels to 20% of the total cellular proteins in E. coli. Ecker 
et al. (1989) demonstrated that in yeast, ubiquitin fusion resulted in enhanced expression of three 
mammalian proteins by up to 200-fold and all these ubiquitin fusion proteins were correctly 
processed by yeast ubiquitin-specific endopeptidase to release authentic functional proteins. A 

30 similar yeast ubiquitin fusion expression system was reported by Sabin el al. ( 1 989). in which 
ubiquitin/human y-interferon and ubiquitin/ai-proteinase inhibitor were highly expressed and 
quantitatively cleaved to yield y-IFN and ccl-PI with authentic amino termini. 
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Since these early reports, a wealth of studies on ubiquitin fusion expression of various 
proteins in E coli and yeast have been described (Baker et al., 1994; Barr et al., 1991 ; Coggan 
et al., 1995; Gali and Board, 1995; Gehring et al., 1995; Han et al., 1994; Kiefer et al., 1992; Lu 
etal., 1990;Lyttleetal., 1992; Maketal., 1989; McDonnell etal., 1989; McDonnell et al., 1991; 
5 Pilon et al., 1996; Poletti et al., 1992; Rian et al., 1993; Tan and Board, 1996; Welch et al., 
1 995). Very often fusion to ubiquitin led to dramatic enhancement in yield of the fusion protein 
in bacteria, or of the cleaved product in yeast. 

Enhanced expression of foreign proteins by ubiquitin fusion has also been observed in 
plants. In analysis of the promoter of the tobacco polyubiquitin gene, Ubi. U4, by driving 

10 transient expression of the GUS reporter in tobacco protoplasts, Genschik et al. (1994) found 
deletion of the intron sequence from the Ubi. U4 fragment spanning from -263 to the end of the 
first ubiquitin-coding unit had no detectable influence on the GUS activity, but further deletion 
of the ubiquitin-coding sequence diminished the GUS activity by 55%. 

None of these studies has shown the direct enhancing function of the ubiquitin fusion 

15 from a heterologous promoter. Garbarino and Belknap (1994) observed that fusion of the 
promoter plus ubiquitin-coding region of the potato ubiquitin extension protein gene ubi 3 to the 
GUS reporter gene resulted in GUS activity 5- to 10-fold higher than the direct fusion of the ubi 
3 promoter to the GUS gene did in transgenic potato. Again, the synergistic effect of the ubi 3 
promoter and the ubiquitin-coding sequence on the enhanced GUS activity was not excluded. 

20 In another study with a potato polyubiquitin gene, ubi 7, the same group (Garbarino et al., 1 995) 
demonstrated that in transgenic potato plants GUS expression level from the fusion construct 
containing the ubi 7promoter-5' untranslated sequence-intron- first ubiquitin coding unit was 10 
times higher than that derived by only the ubi 7 promoter with the 5' untranslated sequence. 
However, the effects of the intron and the ubiquitin protein fusion in increasing expression level 

25 of the GUS reporter were not clearly discriminated. 

In addition to the above mentioned journal papers, a number of patents related to the 
ubiquitin fusion technology have been filed since 1989. They are shown in Table 1. The 
publications and other materials used herein to illuminate the background of the invention or 
provide additional details respecting the practice, are incorporated by reference, and for 

}0 convenience are respectively grouped in the appended List of References. 
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Table 1 



Patents related to the ubiquil 




ogy. 


Title 


Inventor 


Patent No. 


Filing Date 


Host 
cells 


Generating desired amino- 
tf*rminal r^idn^* in nrotein 


MIT 


WO 8909829 


10/19/1989 




Regulation metabolic stability 


MIT 


US 5093242 


3/3/1992 


mammal, 
yeast 


NlMf*lf*ir nftH crvrwtnict^ malaria 
polypeptides and vaccines 


Chiron 


WO 9208795 


5/29/1992 


yeast 


Production of a protein with a 
predetermined amino-terminal 
amino acid residue 


MIT 


US 5196321 


3/23/1993 


E. coli 


Yeast expression system for 
retinoid-X receptor 


American 

V- y al 1 <U 1 1 1 (J 


EP 608532 


8/3/1994 


yeast 


Recombinant DNA vectors 


Mascarenhas 


WO 9423040 


10/13/1994 


E. coli 


New heat-inducible N-degron 
protein and nucleic acid 
encoding it 


Varshavsky, 

Dohmen, 

Johnston, 

vv u 


WO 9521269 


8/10/1995 




Fusion proteins containing the 
iN-or v^-iennniai ui uuitjuiuii 


Varshavsky, 

JVJllilkllv'lt 


WO 9529195 


11/2/1995 




New fusion protein of ubiquitin 
plant and lytic peptide 


Carbarino, 

Jaynes, 

Relknan 


WO 9603519 


2/8/1996 


plant 


Production of tissue factor 
pathway-inhibitor in yeast cells 


Innis, 
Creasey 


WO 9604377 


2/15/1996 


yeast 


Stable recombinant ubiquitin- 
lytic peptide fusion protein 


J. Jaynes 


WO 9603522 


2/8/1996 


plant 


Fusion protein encoded by a 
gene construct 


Bachmair, 

Finley, 

Varshavsky 


US 5496721 


5/3/1990 


mammal, 
yeast 
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15 



25 



ST TMMARY <">F THF. INVENTION 

In accordance with the present invention a method for enhancing expression of proteins 
in plants or plant cells is achieved by the fusion of a ubiquitin monomer coding sequence to the 
5' end of the coding sequence of the proteins. Expression of the ubiquitin fusion proteins is 
driven by a promoter other than promoters from polyubiquitin protein genes or ubiquitin 
extension protein genes. Thus enhancement of expression level of the proteins is due to the 5" 
terminal addition of the ubiquitin monomer coding sequence. The ubiquitin fusion proteins are 
cleaved at the carboxy-terminal glycine 76 residue of the ubiquitin, presumably by plant 
ubiquitin specific proteases, to produce proteins with desired biological properties. A second 
aspect of this invention is that the N-terminal peptide of 14 amino acid residues of cucumber 
mosaic virus coat protein (NP14) can be used as an N-terminal fusion partner to increase the 
expression level of target proteins in plants. The N-terminal fusion approaches described in this 
invention allow higher yield production of proteins in plants, either in the authentic forms in the 
ubiquitin fusion system or as the fusion protein in the NP14 fusion system. 



RRTF.F DESCRIPTIO N OF THF. FIGURES 

Figure 1 shows the nucleotide sequence and deduced amino acid sequence of tobacco 
ubi.NC89. The nucleotide sequence is listed as SEQ ID NO:l and the amino acid sequence is 
SEQ ID NO:2 in the Sequence Listing. The primers used in PCR are underlined and the mended 
20 37-mer oligonucleotide is double-underlined. 

Figure 2 shows the synthetic DNA coding for the 14 N-terminal amino acids of CMV CP 
(NP14). The nucleotide sequence is SEQ ID NO:3 and the amino acid sequence is SEQ ID 
NO:4. 

Figure 3 illustrates the construction of the ubiquitin-GUS fusion protein expression vector 
pUG. The nucleotide sequence shown for pSKUBCl is SEQ ID NO:5. the sequence shown for 
pB1221 is SEQ ID NO:6, and the sequence shown for pUG is SEQ ID NO:7. 

Figure 4 illustrates the construction of the NP14-GUS fusion protein expression vector 
pCG. The nucleotide sequence shown for pUCG2 is SEQ ID NO:8. 

Fiuure 5 illustrates the construction of the ubiquilin-luciferase fusion protein expression 
,0 vector pUL. The arrow marked in the recognition sequence of Stu I in pBIubi indicates the end 
of the ubiquitin coding region and the cleavage site of the ubiquitin fusion protein. The upper 
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nucleotide sequence shown for pBIubi is SEQ ID NO:9, the lower nucleotide sequence shown 
for pBIubi is SEQ ID NO: 1 0, and the nucleotide sequence shown for pUL is SEQ ID NO: 1 1 . 

Figure 6 illustrates the construction of the NP14-luciferase fusion protein expression 
vector. The nucleotide sequence shown for pCL is SEQ ID NO: 12. 
5 Figure 7 illustrates the construction of ubiquitin-GUS fusion/LUC dual report binary 

vector pUGL121. 

Figure 8 illustrates the construction of the NP14-GUS fusion/LUC dual reporter binary 
vector pCGL121. 

Figure 9 illustrates the construction of the GUS/LUC dual reporter binary vector 
10 pBIL121. 

Figure 10 illustrates the ubiquitin fusion cloning vector pBIubi. The upper nucleotide 
sequence is SEQ ID NO: 13 and the lower nucleotide sequence is SEQ ID NO: 14. 
Figure 1 1 illustrates the NP14 fusion cloning vector pBINP14. 

15 DF.TATT ,F,D DF^RTPTTON OF THE INVEN TION 

The present invention is directed to methods and constructs for enhancing protein 
production in plants. The methods comprise fusing an expression-enhancing nucleic acid at the 
5' terminus of the gene for which enhanced expression is desired. In one aspect of the invention, 
a ubiquitin gene is inserted in front of the gene encoding the desired protein such that a fusion 
20 protein is produced wherein ubiquitin is directly fused to the amino terminus of the desired 
protein. Enzymes such as C-terminal hydrolases, will cleave at the C-terminus of the ubiquitin 
in the fusion protein thereby releasing the desired protein in its natural form as well as forming 
free ubiquitin. The presence of the ubiquitin gene in the resulting fusion protein results in 
enhanced expression of the gene thereby yielding a greater amount of the desired protein product 
than occurs in the absence of the ubiquitin gene. It is necessary to use only the coding portion 
of the ubiquitin gene. The ubiquitin promoter is unnecessary, and the ubiquitin gene fusion can 
be under the control of a heterologous promoter. 

In a second aspect of the invention, enhanced protein production is seen when a nucleic 
acid encoding 14 amino acids of cucumber mosaic virus coat protein is placed in front of the 
gene encoding a desired protein such that a fusion protein is produced wherein the fusion protein 



25 
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includes the 14 amino acids of the cucumber mosaic virus coat protein at the amino terminus of 
the fusion protein. 

The aspects of the invention are set out in the following Examples which are offered by 
way of illustration and are not intended to limit the invention in any manner. Standard 
5 techniques well known in the art or the techniques specifically described below are utilized. 
Techniques such as transfection of protoplasts, preparation of transgenic tobacco plants, 
fluoromelric GUS assays and luciferase assays are well known to those of skill in the art and are 
not described in detail herein. 



10 EXAMPLE 1 

DNA Sequences Coding for the Tobacco 
Ubiquitin and the N-terminal Peptide of CMV Coat Protein 

The coding sequence of the ubiquitin monomer contains 228 base pairs. The 5' part of 

191 base pairs was obtained by polymerase chain reaction (PCR) amplification on the total DNA 

1 5 of Nicoliana tobacum var. NC89 and the remaining 37 base pairs were prepared as a synthetic 
oligonucleotide. An SphI site encompassing the initiation codon ATG and an Ncol site 
following the last codon GGC were created to facilitate cloning. The tobacco ubiquitin coding 
sequence was then cloned into pGEM-5ZF and sequenced. Figure 1 shows the DNA sequence 
and the deduced amino acid sequence of the tobacco ubiquitin. The 76-amino acid sequence is 

20 identical to that derived from a tobacco polyubiquitin gene ubi. U4 (Genschik et al., 1 994). 
However, the nucleotide sequence of the region amplified from the tobacco DNA is different 
from the corresponding regions of all ubiquitin monomers found in ubi. U-f. We have named this 
tobacco ubiquitin coding sequence as ubi.NC89. 

The cucumber mosaic virus coat protein (CMV CP) is encoded by the viral subgenomic 

25 RNA 4 and comprises 218 amino acid residues. The CP gene of the strain CMV-SD was cloned 
by RT-PCR (Guo et al., 1993) and the cDNA sequence encoding the 14 N-termina! amino acids 
(NP 1 4) was either cut out of the CP gene by NcoI/AccI digestion or chemically synthesized. In 
the synthesized version of the NP14 coding sequence (Figure 2), os erhanging adapters for 
Baml II and Sstl sites were attached to the 5'- and 3'-ends, respectively, for easy cloning. 



30 
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EXAMPLE 2 

Translational Fusion Constructs for Transient Expression Assays 

A. Ubiquitin-GUS fusion construct pUG 

The ubi.NC89 sequence was taken from the plasmid pSKUBCl as an Xbal-Ncol (filled- 
5 in) fragment and inserted into the Xbal-BamHI (filled-in) site upstream of the GUS gene in 
pBI22 1 to construct pUG as shown in Figure 3. 

B. NP14-GUS fusion construct pCG 

Plasmid pUCG2 is a derivative of pBI221, in which the ubi.NC89 sequence and the NP14 
1 0 sequence, linked as a read-through ORE, was inserted into the Xbal-Smal sites in front of the 
GUS gene. The ubiquitin moiety was removed from pUCG2 by Xbal-SacII digestion and pCG 
was formed by recircularizing. Figure 4 illustrates these steps clearly. 

C. Ubiquitin-LUC fusion construct pUL 

15 An Ncol (fiHed-in)-Sstl fragment containing the firefly luciferase (LUC) gene was 

inserted into the ubiquitin fusion vector pBIubi (see Figure 10) downstream of ubi.NC89 via the 
StuI-SstI sites in the poiylinker region, resulting in pUL as shown in Figure 5. 

D. NP14-LUC fusion construct pCL 

20 The Ncol (filled-in)-SstI fragment containing the LUC gene was inserted into the NP 1 4 

fusion vector pBINP14 (see Figure 1 1) downstream of the NP14 coding sequence via AccI (or 
Sail which is the equivalent site here) (filled-in)-Sstl sites, resulting in pCL as shown in Figure 

6. 



25 EXAMPLE 3 

GUS/LUC Dual Reporter Constructs for Stable Transformation 
To examine the enhancing effects of the N-terminal addition of the ubiquitin or CMV CP 
NP14 on GUS expression in stably transformed plants, a series of GUS/LUC (test/reference) dual 
reporter constructs were made. Essentially they are based on the fusion constructs used in 
30 transient expression assays, namely, pUG and pCG. The chimeric GUS expression cassettes 
were moved into the plant transformation intermediate plasmid pBI 121, resulting in pUG121 and 
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pCGI21, respectively. The expression cassette of the reference reporter LUC, which was 
constructed by replacing the GUS gene in pBI221 with the LUC gene, was pre-made as a Hindlll 
fragment (Hind[II-35S/LUC/NOS-HindriI) and then inserted into the unique Hindlll site of 
pUG121, pCG121 and pBI121, respectively. The resulting GUS/LUC dual reporter constructs, 
5 pUGL12I, pCGL121 and pBIL121 are shown in Figures 7, 8 and 9, respectively. 



EXAMPLE 4 

Ubiquitin fusion enhances the expression of GUS and LUC in tobacco protoplasts 
The ubiquitin-GUS fusion construct pUG or the control plasmid pBI221 was introduced 

1 0 into tobacco protoplasts derived from tobacco B Y-2 suspension cells, together with a reference 
plasmid FFO which contained LUC gene driven by the 35S promoter. GUS activities were 
determined and normalized by luciferase activities. In four independent transfection 
experiments, the normalized GUS activities (A GUS) from pUG were considerably higher than 
those from pBI221. The averaged increase fold due to the ubiquitin fusion is 6.0 (Table 2). 

1 5 When using LUC as a reporter and GUS as an internal standard as expressed from pBI22 1 , the 
normalized LUC activities from pUL were 1 .37 to 3.1 1 fold higher than those from the control 
plasmid p35SLUC (35S-LUC-NOS) in three independent transfection experiments, with the 
average increase fold about 2 (Table 3). 



20 EXAMPLE 5 

CMV CP NP14 Is a More Efficient Fusion Partner than Ubiquitin 
The enhancing effects of the NP14 fusion on GUS and LUC expression in tobacco 
protoplasts were examined in experiments parallel to the above mentioned ubiquitin fusion study. 
The NP I4-GUS fusion construct pCG produced an average 1 1-fold higher GUS activity than 
25 didpBI221. These results are shown in Table 2. Fusion of NP 14 to LUC increased the LUC 
activity by 2.87 times, calculated by comparing the normalized LUC activity of pCL to that of 
p35SLUC. These results are shown in Table 3. It is apparent that NP14 is a more efficient 
fusion partner than ubiquitin in augmenting GUS and LUC expression in tobacco cells. 



WO 00/36129 



PCT/SG98/00103 



9 

Table 2 

Normalized GUS activities and enhancing fold of the N-terminal fusion constructs 



plasmid 


pBI221 


pUG 


pCG 


activities 


GUS 


A GUS 


E 


AGUS 


E 


1 


293.3 


3760.0 


12.8 


5743.0 


19.6 


2 


206.7 


584.3 


2.8 


940.8 


4.6 


3 


856.7 


3733.8 


4.4 


6708.0 


7.8 


4 


100.0 


408.8 


4.1 


1247.0 


12.5 


average E value 




6.0±2.2 


11.U3.2 



Notes: 1. The normalized GUS activity AGUS is calculated by the formula 

AGUS„= GUS „ x LUC ; ? , 
LUC„ 

where n represents a particular GUS fusion construct, 221 represents pBI221. 

2. The enhancing fold E is calculated as AGUS„ 

GUS„. 
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Table 3 



Normalized LUC activities and enhancing fold of the N-terminal fusion constructs 



Plasmid 


p35S LUC 


pUL 


pCL 


activities 


A LUC 


average 
A LUC 


ALUC 


average 
ALUC 


E 


ALUC 


average 
ALUC 


E 




1 


252 




274 






457 






i 


2 


329 


290 


518 


396 


1.37 


529 


491 


1.70 




1 


169 




556 






701 






2 


2 


ND 


169 


496 


526 


3.11 


886 


794 


4.70 




2 


64 




141 






270 






3 


2 


160 


112 


181 


164 


1.46 


254 


246 


2.20 




3 


ND 




170 






214 






Mean±SE 




1.98±0.56 


2.87±0.92 



Notes: 1 . The normalized LUC activity ALUC is calculated by the formula 

15 ALUC„ « LUC „ x GUSp35SLUC 

GUS„ 

where n represents a particular LUC fusion construct. 



2. The enhancing fold E is calculated as 

20 ALUC n 

LUCp35SLUC . 
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EXAMPLE 6 

Ubiquitin- and NP14-fusion Enhance GUS Expression in Transgenic Plants 
To examine the enhancing effects of the ubiquitin fusion and the NP14 fusion on GUS 
expression in stably transformed plants, three GUS/LUC (test/reference) dual reporter constructs 
5 were made based on the binary vector pBU21. pUGL121, pCGL121 and pBILI2f contained 
expression cassettes ubiquitin-GUS, NP14-GUS and GUS only (control), respectively, and the 
reference LUC expression cassette was integrated in each plasmid (Figures 7-9). Tobacco plants 
transformed with each of the three constructs were prepared and analyzed for GUS and LUC 
activities. Each plant was analyzed twice in two independent experiments and only those plants 

1 0 displaying reasonable consistency of the relative GUS activities (GUS/LUC) in two experiments 
were included for comparison. As shown in Table 4, although variations in the relative GUS 
activities existed among different transformants from the same constructs, the average GUS 
expression level of 5 qualified plants containing the 35S-ubiquitin/GUS fusion construct was 4 
times higher than that derived from 6 plants containing the 35S-GUS construct, confirming the 

1 5 enhancing effect of the ubiquitin fusion on GUS expression as previously observed in tobacco 
protoplasts. Again, the NP14 fusion displayed a higher enhancing effect on GUS expression than 
did the ubiquitin fusion. The average relative GUS activity of 14 pCGL plants was about 7 fold 
that derived from the pBIL121 construct. 

20 EXAMPLE 7 

Ubiquitin fusion and NP14 fusion cloning vectors 
pBIubi (Figure 10) and pBINP14 (Figure 1 1) are two fusion protein expression vectors 
allowing for insertion of target genes downstream of the ubi.NC89 and the CMV CP NP14 
coding sequence, respectively. Both vectors are derivatives of pBI221, with the GUS gene being 

25 replaced by the ubi.NC89 or NP14 coding sequence. In pBIubi, a polylinker sequence was 
attached to the 3' end of the ubi.KC89 sequence and the penultimate codon of the ubi.NC89 was 
changed from GGT to GGA for creating a StuI site in the polylinker region. In pBINP14, two 
cloning sites. Sail (here equivalent to an AccI site) and SstI, are available for cloning the target 
genes downstream from the NP 1 4 sequence (the last 5 base pairs of the NP 1 4 sequence form part 

30 of ihe Sail recognition sequence). In order to use AccI instead of Sail for cleaving pBINPJ4, (he 
AccI site at -393 of the CaMV 35S promoter was eliminated. 
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Table 4 

Effects of ubiquitin- and NP14-fusion on GUS expression in transgenic tobacco plants 



Relative GUS activities: GUS/LUC (pmol MUmm '/cpm x 10" 3 ) 



Plant 


P UGL12I 


P CGL121 


pBIL121 


lines 


exp. 1 


exp. 2 


average 


exp. 1 


exp. 2 


average 


exp. 1 


exp. 2 


average 


1 


12.9 


15.3 


14.1 


2.4 


3.4 


2.9 


1.4 


2.6 


2 


2 


13 


43 


28 


4.5 


6.8 


5.65 


5.2 


2.4 


3.8 


3 


0.7 


0.5 


0.6 


63.2 


9.5 


36.35 


4.2 


0.6 


2.4 


4 


0.3 


0.4 


0.35 


26.9 


8.3 


17.6 


2.5 


5.4 


3.95 


5 


4.8 


0.8 


2.8 


17.8 


22.2 


20 


0.4 


0.38 


0.39 


6 








2.1 


5 


3.55 


0.5 


0.82 


0.66 


7 








4.6 


5.8 


5.2 








8 








58.7 


20.2 


39.45 








9 








15.6 


3.6 


9.6 








10 








17.2 


4.4 


10.8 








11 








3 


1.4 


2.2 








12 








17.9 


24.2 


21.05 








13 








20.7 


19.4 


20.05 








14 








13.7 


25.3 


19.5 








Mean 
±SE 


9.17±5.34 


I5.28±3.18 


2.2±0.61 
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While the invention has been disclosed by reference to the details of preferred 
embodiments of the invention, it is to be understood that the disclosure is intended in an 
illustrative rather than in a limiting sense, as it is contemplated that modifications will readily 
occur to those skilled in the art, within the spirit of the invention and the scope of the appended 
claims. 
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SphI 

GC ATG CAG ATC TTC GTA AAG ACC CTG ACG GGG 
1MQIFVKTLTG 

AAG ACT ATT ACC TTA GAG GTA GAG TCA TCG 
HKTITLEVESS 

GAC ACC ATT GAC AAT GTT AAG GCT AAG ATT 
21DTIDNVKAK I 

CAG GAC AAG GAA GGC ATT CCA CCG GAC CAG 
31QDKEGIPPDQ 

CAG CGG TTG ATT TTC GCA GGT AAG CAG CTT 
41QRLIFAGKQL 

GAG GAT GGC CGA ACA CTA GCT GAC TAC AAC 
51EDGRTLADYN 

ATC CAG AAG GAG TCC ACT CTC CAT CTC GTC 
61 I QKESTLHLV 

TTA AGA CTC CGC GGT GGC CATGG 
71 L R L R G G 

Ncol 

FIG. 1 
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1MDK SE STS 
GATCCATG GAC AAA TCT GAA TCA ACC AGT 
TAC CTG TTT AGA CTT AGT TGG TCA 

A G R N R R 14 
GCT GGT CGT AAC CGT CGA CGAGCT 
CGA CCA GCA TTG GCA GCT GC 

ACCI SstI 



FIG. 2 
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SEQUENCE LISTING 

<110> Fang, Rong-Xiang 

<120> ENHANCED PROTEIN PRODUCTION IN HIGHER PLANTS BY 

N- TERMINAL FUSION OF A UBIQUITIN OR A CUCUMBER MOSAIC 
VIRUS COAT PROTEIN PEPTIDE 

<130> 2248-109 

<140> 
<141> 

<160> 14 

<170> Patentln Ver. 2.0 

<210> 1 

<211> 235 

<212> DNA 

<213> Nicotiana tabacum 

<220> 

<221> CDS 

<222> (3) . . (230) 

<220> 

<223> Modified from wild-type to insert an SphI site in 
the region encompassing the initiation codon ATG 
and to insert an Ncol site following the last 
codon GGC. 

<400> 1 

gc atg cag ate ttc gta aag acc ctg acg ggg aag act att acc tta 4 7 

Met Gin He Phe Val Lys Thr Leu Thr Gly Lys Thr He Thr Leu 
15 10 15 

gag gta gag tea teg gac acc att gac aat gtt aag get aag att cag 95 
Glu Val Glu Ser Ser Asp Thr He Asp Asn Val Lys Ala Lys He Gin 
20 25 30 

gac aag gaa ggc att cca ccg gac cag cag egg ttg att ttc gca ggt 14 3 
Asp Lys Glu Gly He Pro Pro Asp Gin Gin Arg Leu He Phe Ala Gly 
35 4 0 4 5 

aag cag ctt gag gat ggc cga aca eta get gac tac aac ate cag aag 191 
Lys Gin Leu Glu Asp Gly Arg Thr Leu Ala Asp Tyr Asn He Gin Lys 
50 55 60 

gag tec act etc cat etc gtc tta aga etc cgc ggt ggc catgg 235 
Glu Ser Thr Leu His Leu Val Leu Arg Leu Arg Gly Gly 
65 70 75 
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<210> 2 
<211> 76 
<212> PRT 

<213> Nicotiana tabacum 
<400> 2 

Met Gin He Phe Val Lys Thr Leu Thr Gly Lys Thr He Thr Leu Glu 
15 10 15 

Val Glu Ser Ser Asp Thr He Asp Asn Val Lys Ala Lys He Gin Asp 
20 25 30 

Lys Glu Gly He Pro Pro Asp Gin Gin Arg Leu He Phe Ala Gly Lys 
35 40 45 

Gin Leu Glu Asp Gly Arg Thr Leu Ala Asp Tyr Asn He Gin Lys Glu 
50 55 60 

Ser Thr Leu His Leu Val Leu Arg Leu Arg Gly Gly 
65 70 75 



c210> 3 
<211> 53 
<212> DNA 

<213> cucumber mosaic virus 

<220> 
<221> CDS 
<222> (6) . . (47) 

<400> 3 

gatcc atg gac aaa tct gaa tea acc agt get ggt cgt aac cgt cga 47 

Met Asp Lys Ser Glu Ser Thr Ser Ala Gly Arg Asn Arg Arg 
15 10 

cgagct 53 



<210> 4 
<211> 14 
<212> PRT 

<213> cucumber mosaic virus 
<400> 4 

Met Asp Lys Ser Glu Ser Thr Ser Ala Gly Arg Asn Arg Arg 
15 10 



<210> 5 
<211> 13 
<212> DNA 

<213> Plasmid pSKUBCl 
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<220> 

<221> misc_feature 
<222> ()..) 

<223> Joining region of fusion of two genes. 
<400> 5 

ggccatggac aaa 13 



<210> 6 
<211> 33 
<212> DNA 

<213> Plasmid pBI221 
<220> 

<221> misc_feature 
<222> (1) . . (33) 

<223> Joining region between 35S promoter and GUS gene. 
<400> 6 

tctagaggat ccccgggtgg tcagtccctt atg 3 3 



<210> 7 

<211> 18 

<212> DNA 

<213> Plasmid pUG 

<220> 

<221> misc_f eature 

<222> (1) . . (18) 

<223> Joining region of fusion of genes. 
<400> 7 

ggccatggat ccccgggt 18 



<210> 8 
<211> 18 
<212> DNA 

<213> Plasmid pUCG2 
<220> 

<22l> misc_f eature 
<222> (1) . . (18) 

<223> Joining region of fusion of genes. 
<400> 8 

ctccgcggtg gcatggac 18 



<2io> 9 
<211> 29 
<212> DNA 

<213> Plasmid pBIubi 
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<220> 

<221> misc_f eature 
<222> (X) . . (29) 

<22 3> Joining region between promoter and fused gene. 
<400> 9 

tctagaacta gtggatccct ggcatgcag 29 



<210> 10 
<21l> 35 
<212> DNA 

<213> Plasmid pBIubi 
<220> 

<221> misc_f eature 
<222> (1) . . (35) 

<22 3> Final 2 codons of the ubiguitin gene followed by 
polylinker sequence. 



<210> 11 

<211> 12 

<212> DNA 

<213> Plasmid pUL 

<220> 

<221> mi sc_f eature 
<222> (1) . . (12) 

<223> Joining region between fusion of genes. 
<400> 11 

ggaggcatgg aa 12 

<210> 12 
<2ll> 12 

<212> DNA 

<213> Plasmid pCL 

<220> 

<221> misc_feature 
<222> (1) . . (12) 

<223=> Joining region between fusion of genes. 



<400> 10 

ggaggcctgt cgactcgagc ccgggtaccg agctc 



35 



<400> 



12 



cgtcgcatgg aa 



12 
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<210> 13 
<211> 29 
c212> DNA 

<213> Plasmid pBIubi 
<220> 

<22l> misc_f eature 
<222> (1) - . (29) 

<223> Joining region of fusion of promoter and gene. 
<400> 13 

tctagaacta gtggatccct ggcatgcag 29 



<210> 14 
<211> 35 
<212> DNA 

<213> Plasmid pBIubi 
<220> 

<221> misc_feature 
<222> (1) . . (35) 

<223> Joining region with multicloning sequence between 
fusion of gene and terminator. 



<400> 14 

ggaggcctgt cgactcgagc ccgggtaccg agctc 



35 
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WHAT IS CLAIMED IS : 

1. A method for enhancing production of a desired protein in a plant cell or a plant which 
comprises inserting a first nucleic acid upstream of a second nucleic acid to form a fused 
nucleic acid wherein said first nucleic acid encodes a ubiquitin monomer and wherein 
said second nucleic acid encodes said desired protein and further wherein said fused 
nucleic acid encodes a fusion protein and further wherein expression of said fusion 
protein is not under the control of a ubiquitin promoter. 

2. The method of claim 1 wherein said ubiquitin monomer consists of SEQ ID NO:2. 

3. The method of claim 1 wherein the carboxy terminus of said ubiquitin forms a peptide 
linkage with the amino terminus of said desired protein. 

4. The method of claim 1 wherein said first nucleic acid comprises bases 3-230 of SEQ ID 
NO:l. 

5. The method of claim 1 wherein said fused nucleic acid is under the control of a 35S 
promoter. 

6. A method for enhancing production of a desired protein as part of a fusion protein in a 
plant cell or a plant which comprises inserting a first nucleic acid upstream of a second 
nucleic acid to form a fused nucleic acid wherein said first nucleic acid encodes a protein 
of SEQ ID NO:4 and wherein said second nucleic acid encodes said desired protein and 
further wherein said fused nucleic acid encodes said fusion protein. 

7. The method of claim 6 wherein the carboxy terminus of said protein of SEQ ID NO:4 
forms a peptide linkage with the amino terminus of said desired protein. 

8. The method of claim 6 wherein said first nucleic acid comprises bases 6-47 of SEQ ID 
NO:3. 
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9. The method of claim 6 wherein said fused nucleic acid is under the control of a 35S 
promoter. 

10. A nucleic acid vector capable of transforming a plant cell wherein said vector comprises 
nucleic acid which encodes a fusion protein wherein said fusion protein comprises a 
ubiquitin monomer linked to a protein of interest and further wherein expression of said 
fusion protein is not under the control of a ubiquitin promoter. 

1 1 . The vector of claim 1 0 wherein said ubiquitin consists of SEQ ID NO:2. 

12. The vector of claim 10 wherein said ubiquitin is linked in a peptide linkage at its carboxy 
terminus to the amino terminus of said protein of interest. 

13. The vector of claim 10 wherein said nucleic acid is under the control of a 35S promoter. 

14. The vector of claim 10 wherein said vector comprises bases 3-230 of SEQ IDNO:l. 

15. A nucleic acid vector capable of transforming a plant cell wherein said vector comprises 
a nucleic acid which encodes a fusion protein wherein said fusion protein comprises a 
protein of SEQ ID NO:4 linked to a protein of interest. 

16. The vector of claim 15 wherein said protein of SEQ ID NO:4 is linked in a peptide 
linkage at its carboxy terminus to the amino terminus of said protein of interest. 

1 7. The vector of claim 15 wherein said nucleic acid is under the control of a 35S promoter. 

18. The vector of claim 15 wherein said vector comprises bases 6-47 of SEQ ID NO:3. 

1 9. A plant cell or a plant comprising the vector of claim 1 0. 

20. A plant cell or a plant comprising the vector of claim 1 5. 
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21. A nucleic acid comprising SEQ ID NO: 1 . 

22. A nucleic acid consisting of SEQ ID NO: 1 . 

23. A nucleic acid comprising SEQ ID NO:3. 

24. A nucleic acid consisting of SEQ ID NO:3. 

25. A protein comprising SEQ ID NO:2. 

26. A protein consisting of SEQ ID NO:2. 

27. A protein consisting of SEQ ID NO:4, 

28. A fusion protein wherein said fusion protein comprises a ubiquitin monomer at the amino 
terminus of said fusion protein and wherein said fusion protein comprises a second 
protein at its carboxy terminus. 

29. The fusion protein of claim 28 wherein said ubiquitin monomer consists of SEQ ID 
NO:2. 

30. The fusion protein of claim 28 wherein the carboxy terminus of said ubiquitin monomer 
forms a peptide linkage with the amino terminus of said second protein. 

31. A fusion protein wherein said fusion protein comprises a protein of SEQ ID NO:4 at the 
amino terminus of said fusion protein and wherein said fusion protein comprises a second 
protein at its carboxy terminus. 

32. The fusion protein of claim 31 wherein the carboxy terminus of said protein of SEQ ID 
NO:4 forms a peptide linkage with the amino terminus of said second protein. 



